Design of optimal labeling patterns for optical genome mapping via information theory
نویسندگان
چکیده
Abstract Motivation Optical genome mapping (OGM) is a technique that extracts partial genomic information from optically imaged and linearized DNA fragments containing fluorescently labeled short sequence patterns. This can be used for various analyses applications, such as the detection of structural variations copy-number variations, epigenomic profiling, microbial species identification. Currently, choice patterns based on available bio-chemical methods, not necessarily optimized application. Results In this work, we develop model OGM theory, which enables design optimal labeling specific applications target organism genomes. We validated through experimental human simulations bacterial DNA. Our predicts up to 10-fold improved accuracy by patterns, may guide future development methods significantly improve its yield profiling cultivation-free pathogen identification in clinical samples. Availability implementation https://github.com/yevgenin/PatternCode
منابع مشابه
Design of Experiments via Information Theory
We discuss an idea for collecting data in a relatively efficient manner. Our point of view is Bayesian and information-theoretic: on any given trial, we want to adaptively choose the input in such a way that the mutual information between the (unknown) state of the system and the (stochastic) output is maximal, given any prior information (including data collected on any previous trials). We pr...
متن کاملFast and Cheap Genome Wide Haplotype Construction via Optical Mapping
We describe an efficient algorithm to construct genome wide haplotype restriction maps of an individual by aligning single molecule DNA fragments collected with Optical Mapping technology. Using this algorithm and small amount of genomic material, we can construct the parental haplotypes for each diploid chromosome for any individual. Since such haplotype maps reveal the polymorphisms due to si...
متن کاملWhole Genome Optical Mapping
An innovative new technology, optical mapping, is used to infer the genome map of the location of short sequence patterns called restriction sites. The technology, developed by David Schwartz, allows the visualization of the maps of randomly located single molecules around a million base pairs in length. The genome map is constructed from overlapping these shorter maps. The mathematical and com...
متن کاملApplication of Stochastic Optimal Control, Game Theory and Information Fusion for Cyber Defense Modelling
The present paper addresses an effective cyber defense model by applying information fusion based game theoretical approaches. In the present paper, we are trying to improve previous models by applying stochastic optimal control and robust optimization techniques. Jump processes are applied to model different and complex situations in cyber games. Applying jump processes we propose some m...
متن کاملinvestigating the feasibility of a proposed model for geometric design of deployable arch structures
deployable scissor type structures are composed of the so-called scissor-like elements (sles), which are connected to each other at an intermediate point through a pivotal connection and allow them to be folded into a compact bundle for storage or transport. several sles are connected to each other in order to form units with regular polygonal plan views. the sides and radii of the polygons are...
ذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Bioinformatics
سال: 2023
ISSN: ['1367-4811', '1367-4803']
DOI: https://doi.org/10.1093/bioinformatics/btad601